Whole Genome Variant Dataset for Enriching Studies across 18 Different Cancers

نویسندگان

چکیده

Whole genome sequencing (WGS) has helped to revolutionize biology, but the computational challenge remains for extracting valuable inferences from this information. Here, we present cancer-associated variants Cancer Genome Atlas (TCGA) WGS dataset. This set of data will allow cancer researchers further expand their analysis beyond exomic regions entire genome. A total 1342 alignments available consortium were processed with VarScan2 and deposited NCI Cloud. The sample covers 18 different cancers reveals 157,313,519 pooled (non-unique) single-nucleotide variations (SNVs) across all samples. There was an average 117,223 SNVs per sample, a range 1111 775,470 standard deviation 163,273. dataset incorporated into BigQuery, which allows fast access cross-mapping, enrich current studies plethora newly genomic data.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Dataset Generator for Whole Genome Shotgun Sequencing

Simulated data sets have been found to be useful in developing software systems because (1) they allow one to study the effect of a particular phenomenon in isolation, and (2) one has complete information about the true solution against which to measure the results of the software. In developing a software suite for assembling a whole human genome shotgun data set, we have developed a simulator...

متن کامل

Mapping whole genome shotgun sequence and variant

Genomics research in mammals has produced reference genome sequences that are essential for identifying variation associated with disease. High quality reference genome sequences are now available for humans, model species, and economically important agricultural animals. Comparisons between these species have provided unique insights into mammalian gene function. However, the number of speci...

متن کامل

Mapping whole genome shotgun sequence and variant

Genomics research in mammals has produced reference genome sequences that are essential for identifying variation associated with disease. High quality reference genome sequences are now available for humans, model species, and economically important agricultural animals. Comparisons between these species have provided unique insights into mammalian gene function. However, the number of speci...

متن کامل

Mapping whole genome shotgun sequence and variant

Genomics research in mammals has produced reference genome sequences that are essential for identifying variation associated with disease. High quality reference genome sequences are now available for humans, model species, and economically important agricultural animals. Comparisons between these species have provided unique insights into mammalian gene function. However, the number of speci...

متن کامل

Rare variant density across the genome and across populations

Next-generation sequencing allows for a new focus on rare variant density for conducting analyses of association to disease and for narrowing down the genomic regions that show evidence of functionality. In this study we use the 1000 Genomes Project pilot data as distributed by Genetic Analysis Workshop 17 to compare rare variant densities across seven populations. We made the comparisons using...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Onco

سال: 2022

ISSN: ['2673-7523']

DOI: https://doi.org/10.3390/onco2020009